DROPS

Found 2 Possible Name Variants:

Pereira, José Casimiro
Pereira, José

Pereira, José Casimiro

Document

DOI: 10.4230/OASIcs.SLATE.2017.20

Natural Transmission of Information Extraction Results to End-Users - A Proof-of-Concept Using Data-to-Text

Authors: José Casimiro Pereira, António J. S. Teixeira, Mário Rodrigues, Pedro Miguel, and Joaquim Sousa Pinto

Published in: OASIcs, Volume 56, 6th Symposium on Languages, Applications and Technologies (SLATE 2017)

Abstract

Information Extraction from natural texts has a great potential in areas such as Tourism and can be of great assistance in transforming customers' comments in valuable information for Tourism operators, governments and customers. After extraction, information needs to be efficiently transmitted to end-users in a natural way. Systems should not, in general, send extracted information directly to end-users, such as hotel managers, as it can be difficult to read. Naturally, humans transmit and encode information using natural languages, such as Portuguese. The problem arising from the need of efficient and natural transmission of the information to end-user is how to encode it. The use of natural language generation (NLG) is a possible solution, for producing sentences, and, with them, texts. In this paper we address this, with a data-to-text system, a derivation of formal NLG systems that use data as input. The proposed system uses an aligned corpus, which was defined, collected and processed, in about approximately 3 weeks of work. To build the language model were used three different in-domain and out-of-domain corpora. The effects of this approach were evaluated, and results are presented. Automatic metrics, BLEU and Meteor, were used to evaluate the different systems, comparing their values with similar systems. Results show that expanding the corpus has a major positive effect in BLEU and Meteor scores and use of additional corpora (in-domain and out-of-domain) in training language model does not result in significantly different performance. The scores obtained, combined with their comparison with other systems performance and informal evaluation by humans of the sentences produced, give additional support for the capabilities of the translation based approach for fast development of data-to-text for new domains.

Cite as

José Casimiro Pereira, António J. S. Teixeira, Mário Rodrigues, Pedro Miguel, and Joaquim Sousa Pinto. Natural Transmission of Information Extraction Results to End-Users - A Proof-of-Concept Using Data-to-Text. In 6th Symposium on Languages, Applications and Technologies (SLATE 2017). Open Access Series in Informatics (OASIcs), Volume 56, pp. 20:1-20:14, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2017)

Copy BibTex To Clipboard

@InProceedings{pereira_et_al:OASIcs.SLATE.2017.20,
  author =	{Pereira, Jos\'{e} Casimiro and Teixeira, Ant\'{o}nio J. S. and Rodrigues, M\'{a}rio and Miguel, Pedro and Pinto, Joaquim Sousa},
  title =	{{Natural Transmission of Information Extraction Results to End-Users - A Proof-of-Concept Using Data-to-Text}},
  booktitle =	{6th Symposium on Languages, Applications and Technologies (SLATE 2017)},
  pages =	{20:1--20:14},
  series =	{Open Access Series in Informatics (OASIcs)},
  ISBN =	{978-3-95977-056-9},
  ISSN =	{2190-6807},
  year =	{2017},
  volume =	{56},
  editor =	{Queir\'{o}s, Ricardo and Pinto, M\'{a}rio and Sim\~{o}es, Alberto and Leal, Jos\'{e} Paulo and Varanda, Maria Jo\~{a}o},
  publisher =	{Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{https://drops-dev.dagstuhl.de/entities/document/10.4230/OASIcs.SLATE.2017.20},
  URN =		{urn:nbn:de:0030-drops-79530},
  doi =		{10.4230/OASIcs.SLATE.2017.20},
  annote =	{Keywords: Data-to-Text, Natural Language Generation, Automatic Translation, opinions, Tourism, Portuguese}
}

@InProceedings{pereira_et_al:OASIcs.SLATE.2017.20,
  author =	{Pereira, Jos\'{e} Casimiro and Teixeira, Ant\'{o}nio J. S. and Rodrigues, M\'{a}rio and Miguel, Pedro and Pinto, Joaquim Sousa},
  title =	{{Natural Transmission of Information Extraction Results to End-Users - A Proof-of-Concept Using Data-to-Text}},
  booktitle =	{6th Symposium on Languages, Applications and Technologies (SLATE 2017)},
  pages =	{20:1--20:14},
  series =	{Open Access Series in Informatics (OASIcs)},
  ISBN =	{978-3-95977-056-9},
  ISSN =	{2190-6807},
  year =	{2017},
  volume =	{56},
  editor =	{Queir\'{o}s, Ricardo and Pinto, M\'{a}rio and Sim\~{o}es, Alberto and Leal, Jos\'{e} Paulo and Varanda, Maria Jo\~{a}o},
  publisher =	{Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{https://drops-dev.dagstuhl.de/entities/document/10.4230/OASIcs.SLATE.2017.20},
  URN =		{urn:nbn:de:0030-drops-79530},
  doi =		{10.4230/OASIcs.SLATE.2017.20},
  annote =	{Keywords: Data-to-Text, Natural Language Generation, Automatic Translation, opinions, Tourism, Portuguese}
}

Pereira, José

Document

DOI: 10.4230/OASIcs.SLATE.2023.11

Hierarchical Data-Flow Graphs

Authors: José Pereira, Vitor Vieira, and Alberto Simões

Published in: OASIcs, Volume 113, 12th Symposium on Languages, Applications and Technologies (SLATE 2023)

Abstract

Data-Flows are crucial to detect the dependency of statements and expressions in a programming language program. In the context of Static Application Security Testing (SAST), they are heavily used in different aspects, from detecting tainted data to understanding code dependency. In Checkmarx, these data flows are currently computed on the fly, but their efficiency is not the desired, especially when dealing with large projects. With this in mind, a new caching mechanism is being developed, based on hierarchical graphs. In this document, we discuss the basic idea behind this approach, the challenges found and the decisions put in place for the implementation. We will also share the first insights on speed improvements for a proof of concept implementation.

Cite as

José Pereira, Vitor Vieira, and Alberto Simões. Hierarchical Data-Flow Graphs. In 12th Symposium on Languages, Applications and Technologies (SLATE 2023). Open Access Series in Informatics (OASIcs), Volume 113, pp. 11:1-11:9, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2023)

Copy BibTex To Clipboard

@InProceedings{pereira_et_al:OASIcs.SLATE.2023.11,
  author =	{Pereira, Jos\'{e} and Vieira, Vitor and Sim\~{o}es, Alberto},
  title =	{{Hierarchical Data-Flow Graphs}},
  booktitle =	{12th Symposium on Languages, Applications and Technologies (SLATE 2023)},
  pages =	{11:1--11:9},
  series =	{Open Access Series in Informatics (OASIcs)},
  ISBN =	{978-3-95977-291-4},
  ISSN =	{2190-6807},
  year =	{2023},
  volume =	{113},
  editor =	{Sim\~{o}es, Alberto and Ber\'{o}n, Mario Marcelo and Portela, Filipe},
  publisher =	{Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{https://drops-dev.dagstuhl.de/entities/document/10.4230/OASIcs.SLATE.2023.11},
  URN =		{urn:nbn:de:0030-drops-185252},
  doi =		{10.4230/OASIcs.SLATE.2023.11},
  annote =	{Keywords: Data Flow, Static Application Security Testing, Hierarchical Graphs}
}

Document

DOI: 10.4230/LIPIcs.OPODIS.2019.11

Minha: Large-Scale Distributed Systems Testing Made Practical

Authors: Nuno Machado, Francisco Maia, Francisco Neves, Fábio Coelho, and José Pereira

Published in: LIPIcs, Volume 153, 23rd International Conference on Principles of Distributed Systems (OPODIS 2019)

Abstract

Testing large-scale distributed system software is still far from practical as the sheer scale needed and the inherent non-determinism make it very expensive to deploy and use realistically large environments, even with cloud computing and state-of-the-art automation. Moreover, observing global states without disturbing the system under test is itself difficult. This is particularly troubling as the gap between distributed algorithms and their implementations can easily introduce subtle bugs that are disclosed only with suitably large scale tests. We address this challenge with Minha, a framework that virtualizes multiple JVM instances in a single JVM, thus simulating a distributed environment where each host runs on a separate machine, accessing dedicated network and CPU resources. The key contributions are the ability to run off-the-shelf concurrent and distributed JVM bytecode programs while at the same time scaling up to thousands of virtual nodes; and enabling global observation within standard software testing frameworks. Our experiments with two distributed systems show the usefulness of Minha in disclosing errors, evaluating global properties, and in scaling tests orders of magnitude with the same hardware resources.

Cite as

Nuno Machado, Francisco Maia, Francisco Neves, Fábio Coelho, and José Pereira. Minha: Large-Scale Distributed Systems Testing Made Practical. In 23rd International Conference on Principles of Distributed Systems (OPODIS 2019). Leibniz International Proceedings in Informatics (LIPIcs), Volume 153, pp. 11:1-11:17, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2020)

Copy BibTex To Clipboard

@InProceedings{machado_et_al:LIPIcs.OPODIS.2019.11,
  author =	{Machado, Nuno and Maia, Francisco and Neves, Francisco and Coelho, F\'{a}bio and Pereira, Jos\'{e}},
  title =	{{Minha: Large-Scale Distributed Systems Testing Made Practical}},
  booktitle =	{23rd International Conference on Principles of Distributed Systems (OPODIS 2019)},
  pages =	{11:1--11:17},
  series =	{Leibniz International Proceedings in Informatics (LIPIcs)},
  ISBN =	{978-3-95977-133-7},
  ISSN =	{1868-8969},
  year =	{2020},
  volume =	{153},
  editor =	{Felber, Pascal and Friedman, Roy and Gilbert, Seth and Miller, Avery},
  publisher =	{Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{https://drops-dev.dagstuhl.de/entities/document/10.4230/LIPIcs.OPODIS.2019.11},
  URN =		{urn:nbn:de:0030-drops-117979},
  doi =		{10.4230/LIPIcs.OPODIS.2019.11},
  annote =	{Keywords: Distributed software testing, Large scale distributed systems, Simulation}
}

Search Results

Documents authored by Pereira, José Casimiro

Pereira, José Casimiro

Natural Transmission of Information Extraction Results to End-Users - A Proof-of-Concept Using Data-to-Text

Abstract

Cite as

Pereira, José

Hierarchical Data-Flow Graphs

Abstract

Cite as

Minha: Large-Scale Distributed Systems Testing Made Practical

Abstract

Cite as

Thanks for your feedback!

Could not send message